Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 23856 |
| Missing cells | 182 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.9 MiB |
| Average record size in memory | 259.5 B |
Variable types
| NUM | 15 |
|---|---|
| CAT | 2 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-06-15 14:26:45.827548 |
|---|---|
| Analysis finished | 2020-06-15 14:27:56.042579 |
| Duration | 1 minute and 10.22 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
DATE has a high cardinality: 9121 distinct values | High cardinality |
X_3 is highly correlated with X_2 | High correlation |
X_2 is highly correlated with X_3 | High correlation |
X_10 is highly skewed (γ1 = 34.9427132) | Skewed |
X_12 is highly skewed (γ1 = 30.61908319) | Skewed |
DATE is uniformly distributed | Uniform |
INCIDENT_ID has unique values | Unique |
X_1 has 19036 (79.8%) zeros | Zeros |
X_4 has 3335 (14.0%) zeros | Zeros |
X_5 has 4695 (19.7%) zeros | Zeros |
X_7 has 3461 (14.5%) zeros | Zeros |
X_8 has 8774 (36.8%) zeros | Zeros |
X_11 has 2553 (10.7%) zeros | Zeros |
X_12 has 5171 (21.7%) zeros | Zeros |
X_14 has 288 (1.2%) zeros | Zeros |
X_15 has 1017 (4.3%) zeros | Zeros |
| Distinct count | 23856 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 186.5 KiB |
| CR_92071 | 1 |
|---|---|
| CR_74384 | 1 |
| CR_180539 | 1 |
| CR_109551 | 1 |
| CR_101106 | 1 |
| Other values (23851) |
| Value | Count | Frequency (%) | |
| CR_92071 | 1 | < 0.1% | |
| CR_74384 | 1 | < 0.1% | |
| CR_180539 | 1 | < 0.1% | |
| CR_109551 | 1 | < 0.1% | |
| CR_101106 | 1 | < 0.1% | |
| CR_45703 | 1 | < 0.1% | |
| CR_81070 | 1 | < 0.1% | |
| CR_47357 | 1 | < 0.1% | |
| CR_119487 | 1 | < 0.1% | |
| CR_98161 | 1 | < 0.1% | |
| CR_181178 | 1 | < 0.1% | |
| CR_145352 | 1 | < 0.1% | |
| CR_123793 | 1 | < 0.1% | |
| CR_161737 | 1 | < 0.1% | |
| CR_118955 | 1 | < 0.1% | |
| CR_21001 | 1 | < 0.1% | |
| CR_19270 | 1 | < 0.1% | |
| CR_76914 | 1 | < 0.1% | |
| CR_184574 | 1 | < 0.1% | |
| CR_113490 | 1 | < 0.1% | |
| CR_114067 | 1 | < 0.1% | |
| CR_150595 | 1 | < 0.1% | |
| CR_168522 | 1 | < 0.1% | |
| CR_155845 | 1 | < 0.1% | |
| CR_137771 | 1 | < 0.1% | |
| Other values (23831) | 23831 | 99.9% |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.44714118 |
| Min length | 4 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 24042 | 11.9% | |
| C | 23856 | 11.8% | |
| R | 23856 | 11.8% | |
| _ | 23856 | 11.8% | |
| 4 | 12071 | 6.0% | |
| 5 | 12011 | 6.0% | |
| 7 | 11984 | 5.9% | |
| 3 | 11962 | 5.9% | |
| 6 | 11931 | 5.9% | |
| 2 | 11929 | 5.9% | |
| 8 | 11861 | 5.9% | |
| 9 | 11436 | 5.7% | |
| 0 | 10720 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 129947 | 64.5% | |
| Uppercase Letter | 47712 | 23.7% | |
| Connector Punctuation | 23856 | 11.8% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| C | 23856 | 50.0% | |
| R | 23856 | 50.0% |
Most frequent Connector Punctuation characters
| Value | Count | Frequency (%) | |
| _ | 23856 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 24042 | 18.5% | |
| 4 | 12071 | 9.3% | |
| 5 | 12011 | 9.2% | |
| 7 | 11984 | 9.2% | |
| 3 | 11962 | 9.2% | |
| 6 | 11931 | 9.2% | |
| 2 | 11929 | 9.2% | |
| 8 | 11861 | 9.1% | |
| 9 | 11436 | 8.8% | |
| 0 | 10720 | 8.2% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 153803 | 76.3% | |
| Latin | 47712 | 23.7% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| C | 23856 | 50.0% | |
| R | 23856 | 50.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 24042 | 15.6% | |
| _ | 23856 | 15.5% | |
| 4 | 12071 | 7.8% | |
| 5 | 12011 | 7.8% | |
| 7 | 11984 | 7.8% | |
| 3 | 11962 | 7.8% | |
| 6 | 11931 | 7.8% | |
| 2 | 11929 | 7.8% | |
| 8 | 11861 | 7.7% | |
| 9 | 11436 | 7.4% | |
| 0 | 10720 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 201515 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 24042 | 11.9% | |
| C | 23856 | 11.8% | |
| R | 23856 | 11.8% | |
| _ | 23856 | 11.8% | |
| 4 | 12071 | 6.0% | |
| 5 | 12011 | 6.0% | |
| 7 | 11984 | 5.9% | |
| 3 | 11962 | 5.9% | |
| 6 | 11931 | 5.9% | |
| 2 | 11929 | 5.9% | |
| 8 | 11861 | 5.9% | |
| 9 | 11436 | 5.7% | |
| 0 | 10720 | 5.3% |
| Distinct count | 9121 |
|---|---|
| Unique (%) | 38.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 186.5 KiB |
| 12-SEP-01 | 22 |
|---|---|
| 13-SEP-01 | 20 |
| 17-SEP-01 | 17 |
| 15-SEP-01 | 15 |
| 11-SEP-01 | 15 |
| Other values (9116) |
| Value | Count | Frequency (%) | |
| 12-SEP-01 | 22 | 0.1% | |
| 13-SEP-01 | 20 | 0.1% | |
| 17-SEP-01 | 17 | 0.1% | |
| 15-SEP-01 | 15 | 0.1% | |
| 11-SEP-01 | 15 | 0.1% | |
| 26-SEP-01 | 13 | 0.1% | |
| 16-SEP-01 | 12 | 0.1% | |
| 14-AUG-05 | 11 | < 0.1% | |
| 19-SEP-01 | 11 | < 0.1% | |
| 28-SEP-01 | 11 | < 0.1% | |
| 20-SEP-01 | 11 | < 0.1% | |
| 18-SEP-01 | 11 | < 0.1% | |
| 02-NOV-00 | 11 | < 0.1% | |
| 18-NOV-16 | 10 | < 0.1% | |
| 30-JUN-06 | 10 | < 0.1% | |
| 14-SEP-01 | 10 | < 0.1% | |
| 31-AUG-06 | 9 | < 0.1% | |
| 18-APR-06 | 9 | < 0.1% | |
| 23-SEP-01 | 9 | < 0.1% | |
| 17-MAY-01 | 9 | < 0.1% | |
| 01-MAY-92 | 9 | < 0.1% | |
| 04-FEB-12 | 9 | < 0.1% | |
| 13-JUL-00 | 9 | < 0.1% | |
| 12-NOV-16 | 9 | < 0.1% | |
| 24-SEP-07 | 9 | < 0.1% | |
| Other values (9096) | 23565 | 98.8% |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Most occurring characters
| Value | Count | Frequency (%) | |
| - | 47712 | 22.2% | |
| 0 | 20551 | 9.6% | |
| 1 | 20253 | 9.4% | |
| 2 | 12493 | 5.8% | |
| 9 | 11558 | 5.4% | |
| A | 10095 | 4.7% | |
| U | 6380 | 3.0% | |
| 3 | 6078 | 2.8% | |
| J | 6009 | 2.8% | |
| N | 5705 | 2.7% | |
| E | 5500 | 2.6% | |
| 7 | 5121 | 2.4% | |
| 6 | 5007 | 2.3% | |
| 8 | 4997 | 2.3% | |
| 5 | 4758 | 2.2% | |
| 4 | 4608 | 2.1% | |
| P | 4404 | 2.1% | |
| M | 4132 | 1.9% | |
| R | 4104 | 1.9% | |
| O | 3991 | 1.9% | |
| C | 3633 | 1.7% | |
| S | 2290 | 1.1% | |
| L | 2157 | 1.0% | |
| Y | 2142 | 1.0% | |
| T | 2138 | 1.0% | |
| Other values (5) | 8888 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 95424 | 44.4% | |
| Uppercase Letter | 71568 | 33.3% | |
| Dash Punctuation | 47712 | 22.2% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 20551 | 21.5% | |
| 1 | 20253 | 21.2% | |
| 2 | 12493 | 13.1% | |
| 9 | 11558 | 12.1% | |
| 3 | 6078 | 6.4% | |
| 7 | 5121 | 5.4% | |
| 6 | 5007 | 5.2% | |
| 8 | 4997 | 5.2% | |
| 5 | 4758 | 5.0% | |
| 4 | 4608 | 4.8% |
Most frequent Dash Punctuation characters
| Value | Count | Frequency (%) | |
| - | 47712 | 100.0% |
Most frequent Uppercase Letter characters
| Value | Count | Frequency (%) | |
| A | 10095 | 14.1% | |
| U | 6380 | 8.9% | |
| J | 6009 | 8.4% | |
| N | 5705 | 8.0% | |
| E | 5500 | 7.7% | |
| P | 4404 | 6.2% | |
| M | 4132 | 5.8% | |
| R | 4104 | 5.7% | |
| O | 3991 | 5.6% | |
| C | 3633 | 5.1% | |
| S | 2290 | 3.2% | |
| L | 2157 | 3.0% | |
| Y | 2142 | 3.0% | |
| T | 2138 | 3.0% | |
| G | 2110 | 2.9% | |
| V | 1853 | 2.6% | |
| F | 1715 | 2.4% | |
| B | 1715 | 2.4% | |
| D | 1495 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 143136 | 66.7% | |
| Latin | 71568 | 33.3% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| - | 47712 | 33.3% | |
| 0 | 20551 | 14.4% | |
| 1 | 20253 | 14.1% | |
| 2 | 12493 | 8.7% | |
| 9 | 11558 | 8.1% | |
| 3 | 6078 | 4.2% | |
| 7 | 5121 | 3.6% | |
| 6 | 5007 | 3.5% | |
| 8 | 4997 | 3.5% | |
| 5 | 4758 | 3.3% | |
| 4 | 4608 | 3.2% |
Most frequent Latin characters
| Value | Count | Frequency (%) | |
| A | 10095 | 14.1% | |
| U | 6380 | 8.9% | |
| J | 6009 | 8.4% | |
| N | 5705 | 8.0% | |
| E | 5500 | 7.7% | |
| P | 4404 | 6.2% | |
| M | 4132 | 5.8% | |
| R | 4104 | 5.7% | |
| O | 3991 | 5.6% | |
| C | 3633 | 5.1% | |
| S | 2290 | 3.2% | |
| L | 2157 | 3.0% | |
| Y | 2142 | 3.0% | |
| T | 2138 | 3.0% | |
| G | 2110 | 2.9% | |
| V | 1853 | 2.6% | |
| F | 1715 | 2.4% | |
| B | 1715 | 2.4% | |
| D | 1495 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 214704 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| - | 47712 | 22.2% | |
| 0 | 20551 | 9.6% | |
| 1 | 20253 | 9.4% | |
| 2 | 12493 | 5.8% | |
| 9 | 11558 | 5.4% | |
| A | 10095 | 4.7% | |
| U | 6380 | 3.0% | |
| 3 | 6078 | 2.8% | |
| J | 6009 | 2.8% | |
| N | 5705 | 2.7% | |
| E | 5500 | 2.6% | |
| 7 | 5121 | 2.4% | |
| 6 | 5007 | 2.3% | |
| 8 | 4997 | 2.3% | |
| 5 | 4758 | 2.2% | |
| 4 | 4608 | 2.1% | |
| P | 4404 | 2.1% | |
| M | 4132 | 1.9% | |
| R | 4104 | 1.9% | |
| O | 3991 | 1.9% | |
| C | 3633 | 1.7% | |
| S | 2290 | 1.1% | |
| L | 2157 | 1.0% | |
| Y | 2142 | 1.0% | |
| T | 2138 | 1.0% | |
| Other values (5) | 8888 | 4.1% |
| Distinct count | 8 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4837776659959759 |
|---|---|
| Minimum | 0 |
| Maximum | 7 |
| Zeros | 19036 |
| Zeros (%) | 79.8% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.439737889 |
|---|---|
| Coefficient of variation (CV) | 2.976032152 |
| Kurtosis | 13.65891063 |
| Mean | 0.483777666 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.789307148 |
| Sum | 11541 |
| Variance | 2.072845188 |
| Value | Count | Frequency (%) | |
| 0 | 19036 | 79.8% | |
| 1 | 3497 | 14.7% | |
| 7 | 876 | 3.7% | |
| 5 | 270 | 1.1% | |
| 3 | 136 | 0.6% | |
| 4 | 26 | 0.1% | |
| 2 | 10 | < 0.1% | |
| 6 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 19036 | 79.8% | |
| 1 | 3497 | 14.7% | |
| 2 | 10 | < 0.1% | |
| 3 | 136 | 0.6% | |
| 4 | 26 | 0.1% | |
| 5 | 270 | 1.1% | |
| 6 | 5 | < 0.1% | |
| 7 | 876 | 3.7% |
| Value | Count | Frequency (%) | |
| 7 | 876 | 3.7% | |
| 6 | 5 | < 0.1% | |
| 5 | 270 | 1.1% | |
| 4 | 26 | 0.1% | |
| 3 | 136 | 0.6% | |
| 2 | 10 | < 0.1% | |
| 1 | 3497 | 14.7% | |
| 0 | 19036 | 79.8% |
| Distinct count | 52 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.791205566733737 |
|---|---|
| Minimum | 0 |
| Maximum | 52 |
| Zeros | 22 |
| Zeros (%) | 0.1% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 7 |
| median | 24 |
| Q3 | 36 |
| 95-th percentile | 49 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 15.24023098 |
|---|---|
| Coefficient of variation (CV) | 0.6147434395 |
| Kurtosis | -1.30551524 |
| Mean | 24.79120557 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.0947521072 |
| Sum | 591419 |
| Variance | 232.2646403 |
| Value | Count | Frequency (%) | |
| 4 | 4029 | 16.9% | |
| 36 | 2232 | 9.4% | |
| 33 | 2174 | 9.1% | |
| 24 | 1344 | 5.6% | |
| 21 | 1254 | 5.3% | |
| 37 | 962 | 4.0% | |
| 49 | 927 | 3.9% | |
| 45 | 908 | 3.8% | |
| 3 | 778 | 3.3% | |
| 22 | 672 | 2.8% | |
| 47 | 641 | 2.7% | |
| 16 | 631 | 2.6% | |
| 9 | 593 | 2.5% | |
| 39 | 513 | 2.2% | |
| 25 | 499 | 2.1% | |
| 5 | 437 | 1.8% | |
| 6 | 434 | 1.8% | |
| 44 | 428 | 1.8% | |
| 40 | 385 | 1.6% | |
| 19 | 370 | 1.6% | |
| 26 | 356 | 1.5% | |
| 30 | 266 | 1.1% | |
| 42 | 238 | 1.0% | |
| 17 | 238 | 1.0% | |
| 18 | 210 | 0.9% | |
| Other values (27) | 2337 | 9.8% |
| Value | Count | Frequency (%) | |
| 0 | 22 | 0.1% | |
| 1 | 20 | 0.1% | |
| 2 | 116 | 0.5% | |
| 3 | 778 | 3.3% | |
| 4 | 4029 | 16.9% | |
| 5 | 437 | 1.8% | |
| 6 | 434 | 1.8% | |
| 7 | 166 | 0.7% | |
| 8 | 104 | 0.4% | |
| 9 | 593 | 2.5% |
| Value | Count | Frequency (%) | |
| 52 | 19 | 0.1% | |
| 51 | 103 | 0.4% | |
| 50 | 160 | 0.7% | |
| 49 | 927 | 3.9% | |
| 48 | 55 | 0.2% | |
| 47 | 641 | 2.7% | |
| 46 | 181 | 0.8% | |
| 45 | 908 | 3.8% | |
| 44 | 428 | 1.8% | |
| 43 | 69 | 0.3% |
| Distinct count | 52 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.637449698189133 |
|---|---|
| Minimum | 0 |
| Maximum | 52 |
| Zeros | 20 |
| Zeros (%) | 0.1% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 8 |
| median | 24 |
| Q3 | 35 |
| 95-th percentile | 49 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 15.1350925 |
|---|---|
| Coefficient of variation (CV) | 0.6143124669 |
| Kurtosis | -1.237143987 |
| Mean | 24.6374497 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.08212039854 |
| Sum | 587751 |
| Variance | 229.071025 |
| Value | Count | Frequency (%) | |
| 4 | 4029 | 16.9% | |
| 34 | 2232 | 9.4% | |
| 32 | 2174 | 9.1% | |
| 24 | 1344 | 5.6% | |
| 23 | 1254 | 5.3% | |
| 37 | 962 | 4.0% | |
| 49 | 927 | 3.9% | |
| 45 | 908 | 3.8% | |
| 2 | 778 | 3.3% | |
| 22 | 672 | 2.8% | |
| 48 | 641 | 2.7% | |
| 15 | 631 | 2.6% | |
| 10 | 593 | 2.5% | |
| 39 | 513 | 2.2% | |
| 25 | 499 | 2.1% | |
| 5 | 437 | 1.8% | |
| 6 | 434 | 1.8% | |
| 44 | 428 | 1.8% | |
| 40 | 385 | 1.6% | |
| 19 | 370 | 1.6% | |
| 27 | 356 | 1.5% | |
| 35 | 266 | 1.1% | |
| 42 | 238 | 1.0% | |
| 16 | 238 | 1.0% | |
| 18 | 210 | 0.9% | |
| Other values (27) | 2337 | 9.8% |
| Value | Count | Frequency (%) | |
| 0 | 20 | 0.1% | |
| 1 | 22 | 0.1% | |
| 2 | 778 | 3.3% | |
| 3 | 116 | 0.5% | |
| 4 | 4029 | 16.9% | |
| 5 | 437 | 1.8% | |
| 6 | 434 | 1.8% | |
| 7 | 104 | 0.4% | |
| 8 | 166 | 0.7% | |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 52 | 19 | 0.1% | |
| 51 | 160 | 0.7% | |
| 50 | 103 | 0.4% | |
| 49 | 927 | 3.9% | |
| 48 | 641 | 2.7% | |
| 47 | 55 | 0.2% | |
| 46 | 181 | 0.8% | |
| 45 | 908 | 3.8% | |
| 44 | 428 | 1.8% | |
| 43 | 69 | 0.3% |
| Distinct count | 10 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.276743796109994 |
|---|---|
| Minimum | 0 |
| Maximum | 10 |
| Zeros | 3335 |
| Zeros (%) | 14.0% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.944672067 |
|---|---|
| Coefficient of variation (CV) | 0.6885313238 |
| Kurtosis | -1.013239087 |
| Mean | 4.276743796 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1833932631 |
| Sum | 102026 |
| Variance | 8.671093584 |
| Value | Count | Frequency (%) | |
| 6 | 5497 | 23.0% | |
| 2 | 4791 | 20.1% | |
| 0 | 3335 | 14.0% | |
| 7 | 2890 | 12.1% | |
| 4 | 2027 | 8.5% | |
| 3 | 1871 | 7.8% | |
| 9 | 1360 | 5.7% | |
| 10 | 1242 | 5.2% | |
| 1 | 841 | 3.5% | |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3335 | 14.0% | |
| 1 | 841 | 3.5% | |
| 2 | 4791 | 20.1% | |
| 3 | 1871 | 7.8% | |
| 4 | 2027 | 8.5% | |
| 5 | 2 | < 0.1% | |
| 6 | 5497 | 23.0% | |
| 7 | 2890 | 12.1% | |
| 9 | 1360 | 5.7% | |
| 10 | 1242 | 5.2% |
| Value | Count | Frequency (%) | |
| 10 | 1242 | 5.2% | |
| 9 | 1360 | 5.7% | |
| 7 | 2890 | 12.1% | |
| 6 | 5497 | 23.0% | |
| 5 | 2 | < 0.1% | |
| 4 | 2027 | 8.5% | |
| 3 | 1871 | 7.8% | |
| 2 | 4791 | 20.1% | |
| 1 | 841 | 3.5% | |
| 0 | 3335 | 14.0% |
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.4556086519114686 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 4695 |
| Zeros (%) | 19.7% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.963094729 |
|---|---|
| Coefficient of variation (CV) | 0.7994330562 |
| Kurtosis | -1.558871205 |
| Mean | 2.455608652 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1752310231 |
| Sum | 58581 |
| Variance | 3.853740916 |
| Value | Count | Frequency (%) | |
| 5 | 7368 | 30.9% | |
| 1 | 6818 | 28.6% | |
| 3 | 4973 | 20.8% | |
| 0 | 4695 | 19.7% | |
| 2 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 4695 | 19.7% | |
| 1 | 6818 | 28.6% | |
| 2 | 2 | < 0.1% | |
| 3 | 4973 | 20.8% | |
| 5 | 7368 | 30.9% |
| Value | Count | Frequency (%) | |
| 5 | 7368 | 30.9% | |
| 3 | 4973 | 20.8% | |
| 2 | 2 | < 0.1% | |
| 1 | 6818 | 28.6% | |
| 0 | 4695 | 19.7% |
X_6
Real number (ℝ≥0)
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.154175050301811 |
|---|---|
| Minimum | 1 |
| Maximum | 19 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 15 |
| Maximum | 19 |
| Range | 18 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 4.471756047 |
|---|---|
| Coefficient of variation (CV) | 0.7266215229 |
| Kurtosis | 0.03760850344 |
| Mean | 6.15417505 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.9608294397 |
| Sum | 146814 |
| Variance | 19.99660214 |
| Value | Count | Frequency (%) | |
| 1 | 3461 | 14.5% | |
| 5 | 2679 | 11.2% | |
| 6 | 2629 | 11.0% | |
| 4 | 2319 | 9.7% | |
| 15 | 2318 | 9.7% | |
| 2 | 2298 | 9.6% | |
| 7 | 2286 | 9.6% | |
| 3 | 1708 | 7.2% | |
| 8 | 1405 | 5.9% | |
| 9 | 1267 | 5.3% | |
| 16 | 620 | 2.6% | |
| 12 | 210 | 0.9% | |
| 11 | 200 | 0.8% | |
| 18 | 162 | 0.7% | |
| 13 | 139 | 0.6% | |
| 17 | 110 | 0.5% | |
| 10 | 25 | 0.1% | |
| 14 | 18 | 0.1% | |
| 19 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 3461 | 14.5% | |
| 2 | 2298 | 9.6% | |
| 3 | 1708 | 7.2% | |
| 4 | 2319 | 9.7% | |
| 5 | 2679 | 11.2% | |
| 6 | 2629 | 11.0% | |
| 7 | 2286 | 9.6% | |
| 8 | 1405 | 5.9% | |
| 9 | 1267 | 5.3% | |
| 10 | 25 | 0.1% |
| Value | Count | Frequency (%) | |
| 19 | 2 | < 0.1% | |
| 18 | 162 | 0.7% | |
| 17 | 110 | 0.5% | |
| 16 | 620 | 2.6% | |
| 15 | 2318 | 9.7% | |
| 14 | 18 | 0.1% | |
| 13 | 139 | 0.6% | |
| 12 | 210 | 0.9% | |
| 11 | 200 | 0.8% | |
| 10 | 25 | 0.1% |
| Distinct count | 19 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.876509054325956 |
|---|---|
| Minimum | 0 |
| Maximum | 18 |
| Zeros | 3461 |
| Zeros (%) | 14.5% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 12 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.881930665 |
|---|---|
| Coefficient of variation (CV) | 0.7960470538 |
| Kurtosis | 0.493689765 |
| Mean | 4.876509054 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.7961675929 |
| Sum | 116334 |
| Variance | 15.06938569 |
| Value | Count | Frequency (%) | |
| 0 | 3461 | 14.5% | |
| 6 | 2679 | 11.2% | |
| 4 | 2629 | 11.0% | |
| 2 | 2319 | 9.7% | |
| 10 | 2318 | 9.7% | |
| 7 | 2298 | 9.6% | |
| 1 | 2286 | 9.6% | |
| 5 | 1708 | 7.2% | |
| 3 | 1405 | 5.9% | |
| 8 | 1267 | 5.3% | |
| 12 | 620 | 2.6% | |
| 16 | 210 | 0.9% | |
| 17 | 200 | 0.8% | |
| 13 | 162 | 0.7% | |
| 18 | 139 | 0.6% | |
| 11 | 110 | 0.5% | |
| 15 | 25 | 0.1% | |
| 14 | 18 | 0.1% | |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3461 | 14.5% | |
| 1 | 2286 | 9.6% | |
| 2 | 2319 | 9.7% | |
| 3 | 1405 | 5.9% | |
| 4 | 2629 | 11.0% | |
| 5 | 1708 | 7.2% | |
| 6 | 2679 | 11.2% | |
| 7 | 2298 | 9.6% | |
| 8 | 1267 | 5.3% | |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 18 | 139 | 0.6% | |
| 17 | 200 | 0.8% | |
| 16 | 210 | 0.9% | |
| 15 | 25 | 0.1% | |
| 14 | 18 | 0.1% | |
| 13 | 162 | 0.7% | |
| 12 | 620 | 2.6% | |
| 11 | 110 | 0.5% | |
| 10 | 2318 | 9.7% | |
| 9 | 2 | < 0.1% |
| Distinct count | 24 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9724597585513078 |
|---|---|
| Minimum | 0 |
| Maximum | 99 |
| Zeros | 8774 |
| Zeros (%) | 36.8% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.453144468 |
|---|---|
| Coefficient of variation (CV) | 1.494297789 |
| Kurtosis | 952.9615467 |
| Mean | 0.9724597586 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 17.70384903 |
| Sum | 23199 |
| Variance | 2.111628843 |
| Value | Count | Frequency (%) | |
| 1 | 11010 | 46.2% | |
| 0 | 8774 | 36.8% | |
| 2 | 2268 | 9.5% | |
| 3 | 967 | 4.1% | |
| 4 | 404 | 1.7% | |
| 5 | 207 | 0.9% | |
| 6 | 79 | 0.3% | |
| 7 | 33 | 0.1% | |
| 8 | 32 | 0.1% | |
| 10 | 23 | 0.1% | |
| 9 | 16 | 0.1% | |
| 15 | 11 | < 0.1% | |
| 11 | 8 | < 0.1% | |
| 12 | 8 | < 0.1% | |
| 20 | 4 | < 0.1% | |
| 13 | 2 | < 0.1% | |
| 14 | 2 | < 0.1% | |
| 16 | 2 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 21 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 99 | 1 | < 0.1% | |
| 50 | 1 | < 0.1% | |
| 29 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 8774 | 36.8% | |
| 1 | 11010 | 46.2% | |
| 2 | 2268 | 9.5% | |
| 3 | 967 | 4.1% | |
| 4 | 404 | 1.7% | |
| 5 | 207 | 0.9% | |
| 6 | 79 | 0.3% | |
| 7 | 33 | 0.1% | |
| 8 | 32 | 0.1% | |
| 9 | 16 | 0.1% |
| Value | Count | Frequency (%) | |
| 99 | 1 | < 0.1% | |
| 50 | 1 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 29 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 21 | 1 | < 0.1% | |
| 20 | 4 | < 0.1% | |
| 16 | 2 | < 0.1% | |
| 15 | 11 | < 0.1% | |
| 14 | 2 | < 0.1% |
X_9
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.924128101945003 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 118 |
| Zeros (%) | 0.5% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 5 |
| Q3 | 6 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.362624612 |
|---|---|
| Coefficient of variation (CV) | 0.276724038 |
| Kurtosis | 1.28166232 |
| Mean | 4.924128102 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -1.525286754 |
| Sum | 117470 |
| Variance | 1.856745834 |
| Value | Count | Frequency (%) | |
| 5 | 10559 | 44.3% | |
| 6 | 9508 | 39.9% | |
| 2 | 3040 | 12.7% | |
| 3 | 452 | 1.9% | |
| 1 | 175 | 0.7% | |
| 0 | 118 | 0.5% | |
| 4 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 118 | 0.5% | |
| 1 | 175 | 0.7% | |
| 2 | 3040 | 12.7% | |
| 3 | 452 | 1.9% | |
| 4 | 4 | < 0.1% | |
| 5 | 10559 | 44.3% | |
| 6 | 9508 | 39.9% |
| Value | Count | Frequency (%) | |
| 6 | 9508 | 39.9% | |
| 5 | 10559 | 44.3% | |
| 4 | 4 | < 0.1% | |
| 3 | 452 | 1.9% | |
| 2 | 3040 | 12.7% | |
| 1 | 175 | 0.7% | |
| 0 | 118 | 0.5% |
| Distinct count | 24 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.244802146210597 |
|---|---|
| Minimum | 1 |
| Maximum | 90 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 90 |
| Range | 89 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.119300682 |
|---|---|
| Coefficient of variation (CV) | 0.8991795888 |
| Kurtosis | 2190.137157 |
| Mean | 1.244802146 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 34.9427132 |
| Sum | 29696 |
| Variance | 1.252834017 |
| Value | Count | Frequency (%) | |
| 1 | 20198 | 84.7% | |
| 2 | 2695 | 11.3% | |
| 3 | 549 | 2.3% | |
| 4 | 225 | 0.9% | |
| 5 | 71 | 0.3% | |
| 6 | 54 | 0.2% | |
| 8 | 15 | 0.1% | |
| 10 | 14 | 0.1% | |
| 9 | 7 | < 0.1% | |
| 7 | 7 | < 0.1% | |
| 11 | 4 | < 0.1% | |
| 12 | 3 | < 0.1% | |
| 20 | 2 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 50 | 1 | < 0.1% | |
| 18 | 1 | < 0.1% | |
| 58 | 1 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 90 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 20198 | 84.7% | |
| 2 | 2695 | 11.3% | |
| 3 | 549 | 2.3% | |
| 4 | 225 | 0.9% | |
| 5 | 71 | 0.3% | |
| 6 | 54 | 0.2% | |
| 7 | 7 | < 0.1% | |
| 8 | 15 | 0.1% | |
| 9 | 7 | < 0.1% | |
| 10 | 14 | 0.1% |
| Value | Count | Frequency (%) | |
| 90 | 1 | < 0.1% | |
| 58 | 1 | < 0.1% | |
| 50 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 22 | 1 | < 0.1% | |
| 20 | 2 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 18 | 1 | < 0.1% | |
| 17 | 1 | < 0.1% |
| Distinct count | 133 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 206.95451877934272 |
|---|---|
| Minimum | 0 |
| Maximum | 332 |
| Zeros | 2553 |
| Zeros (%) | 10.7% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 174 |
| median | 249 |
| Q3 | 249 |
| 95-th percentile | 316 |
| Maximum | 332 |
| Range | 332 |
| Interquartile range (IQR) | 75 |
Descriptive statistics
| Standard deviation | 93.03334801 |
|---|---|
| Coefficient of variation (CV) | 0.4495352339 |
| Kurtosis | 0.1944049511 |
| Mean | 206.9545188 |
| Median Absolute Deviation (MAD) | 67 |
| Skewness | -0.9032002688 |
| Sum | 4937107 |
| Variance | 8655.203842 |
| Value | Count | Frequency (%) | |
| 174 | 7275 | 30.5% | |
| 249 | 6930 | 29.0% | |
| 316 | 4500 | 18.9% | |
| 0 | 2553 | 10.7% | |
| 303 | 438 | 1.8% | |
| 127 | 304 | 1.3% | |
| 74 | 207 | 0.9% | |
| 179 | 206 | 0.9% | |
| 102 | 122 | 0.5% | |
| 263 | 103 | 0.4% | |
| 218 | 98 | 0.4% | |
| 328 | 79 | 0.3% | |
| 290 | 76 | 0.3% | |
| 313 | 68 | 0.3% | |
| 43 | 59 | 0.2% | |
| 200 | 59 | 0.2% | |
| 128 | 58 | 0.2% | |
| 325 | 57 | 0.2% | |
| 21 | 45 | 0.2% | |
| 277 | 45 | 0.2% | |
| 71 | 36 | 0.2% | |
| 231 | 34 | 0.1% | |
| 299 | 30 | 0.1% | |
| 208 | 30 | 0.1% | |
| 330 | 29 | 0.1% | |
| Other values (108) | 415 | 1.7% |
| Value | Count | Frequency (%) | |
| 0 | 2553 | 10.7% | |
| 1 | 3 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 11 | 5 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 16 | 2 | < 0.1% | |
| 20 | 1 | < 0.1% | |
| 21 | 45 | 0.2% | |
| 25 | 3 | < 0.1% | |
| 31 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 332 | 3 | < 0.1% | |
| 330 | 29 | 0.1% | |
| 329 | 21 | 0.1% | |
| 328 | 79 | 0.3% | |
| 327 | 1 | < 0.1% | |
| 325 | 57 | 0.2% | |
| 323 | 10 | < 0.1% | |
| 322 | 1 | < 0.1% | |
| 321 | 7 | < 0.1% | |
| 320 | 2 | < 0.1% |
| Distinct count | 23 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 182 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.974064374419194 |
|---|---|
| Minimum | 0.0 |
| Maximum | 90.0 |
| Zeros | 5171 |
| Zeros (%) | 21.7% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.167725118 |
|---|---|
| Coefficient of variation (CV) | 1.198817192 |
| Kurtosis | 1880.955431 |
| Mean | 0.9740643744 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 30.61908319 |
| Sum | 23060 |
| Variance | 1.363581951 |
| Value | Count | Frequency (%) | |
| 1 | 15674 | 65.7% | |
| 0 | 5171 | 21.7% | |
| 2 | 2039 | 8.5% | |
| 3 | 476 | 2.0% | |
| 4 | 176 | 0.7% | |
| 5 | 59 | 0.2% | |
| 6 | 36 | 0.2% | |
| 8 | 9 | < 0.1% | |
| 10 | 7 | < 0.1% | |
| 9 | 6 | < 0.1% | |
| 11 | 4 | < 0.1% | |
| 7 | 4 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 20 | 2 | < 0.1% | |
| 58 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 90 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 14 | 1 | < 0.1% | |
| 50 | 1 | < 0.1% | |
| (Missing) | 182 | 0.8% |
| Value | Count | Frequency (%) | |
| 0 | 5171 | 21.7% | |
| 1 | 15674 | 65.7% | |
| 2 | 2039 | 8.5% | |
| 3 | 476 | 2.0% | |
| 4 | 176 | 0.7% | |
| 5 | 59 | 0.2% | |
| 6 | 36 | 0.2% | |
| 7 | 4 | < 0.1% | |
| 8 | 9 | < 0.1% | |
| 9 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 90 | 1 | < 0.1% | |
| 58 | 1 | < 0.1% | |
| 50 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 30 | 1 | < 0.1% | |
| 20 | 2 | < 0.1% | |
| 17 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 14 | 1 | < 0.1% |
X_13
Real number (ℝ≥0)
| Distinct count | 60 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 85.23738262910798 |
|---|---|
| Minimum | 0 |
| Maximum | 116 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 72 |
| median | 98 |
| Q3 | 103 |
| 95-th percentile | 112 |
| Maximum | 116 |
| Range | 116 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 27.59722639 |
|---|---|
| Coefficient of variation (CV) | 0.3237690499 |
| Kurtosis | 1.093046857 |
| Mean | 85.23738263 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -1.388636749 |
| Sum | 2033423 |
| Variance | 761.6069043 |
| Value | Count | Frequency (%) | |
| 103 | 6995 | 29.3% | |
| 72 | 4476 | 18.8% | |
| 92 | 3255 | 13.6% | |
| 112 | 2116 | 8.9% | |
| 98 | 1366 | 5.7% | |
| 18 | 851 | 3.6% | |
| 109 | 537 | 2.3% | |
| 24 | 523 | 2.2% | |
| 12 | 427 | 1.8% | |
| 59 | 348 | 1.5% | |
| 34 | 342 | 1.4% | |
| 116 | 288 | 1.2% | |
| 54 | 234 | 1.0% | |
| 113 | 225 | 0.9% | |
| 111 | 215 | 0.9% | |
| 67 | 211 | 0.9% | |
| 2 | 210 | 0.9% | |
| 42 | 200 | 0.8% | |
| 48 | 172 | 0.7% | |
| 87 | 150 | 0.6% | |
| 110 | 147 | 0.6% | |
| 84 | 145 | 0.6% | |
| 97 | 76 | 0.3% | |
| 31 | 64 | 0.3% | |
| 89 | 51 | 0.2% | |
| Other values (35) | 232 | 1.0% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 5 | < 0.1% | |
| 2 | 210 | 0.9% | |
| 7 | 1 | < 0.1% | |
| 8 | 2 | < 0.1% | |
| 9 | 9 | < 0.1% | |
| 10 | 46 | 0.2% | |
| 12 | 427 | 1.8% | |
| 13 | 1 | < 0.1% | |
| 17 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 116 | 288 | 1.2% | |
| 115 | 21 | 0.1% | |
| 114 | 16 | 0.1% | |
| 113 | 225 | 0.9% | |
| 112 | 2116 | 8.9% | |
| 111 | 215 | 0.9% | |
| 110 | 147 | 0.6% | |
| 109 | 537 | 2.3% | |
| 108 | 6 | < 0.1% | |
| 103 | 6995 | 29.3% |
| Distinct count | 62 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 72.67429577464789 |
|---|---|
| Minimum | 0 |
| Maximum | 142 |
| Zeros | 288 |
| Zeros (%) | 1.2% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 29 |
| median | 62 |
| Q3 | 107 |
| 95-th percentile | 142 |
| Maximum | 142 |
| Range | 142 |
| Interquartile range (IQR) | 78 |
Descriptive statistics
| Standard deviation | 43.2973203 |
|---|---|
| Coefficient of variation (CV) | 0.5957721342 |
| Kurtosis | -1.324908795 |
| Mean | 72.67429577 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | 0.2455877663 |
| Sum | 1733718 |
| Variance | 1874.657945 |
| Value | Count | Frequency (%) | |
| 29 | 8165 | 34.2% | |
| 93 | 3110 | 13.0% | |
| 142 | 2714 | 11.4% | |
| 62 | 2474 | 10.4% | |
| 80 | 1488 | 6.2% | |
| 130 | 1205 | 5.1% | |
| 107 | 734 | 3.1% | |
| 14 | 657 | 2.8% | |
| 119 | 579 | 2.4% | |
| 103 | 506 | 2.1% | |
| 87 | 455 | 1.9% | |
| 133 | 356 | 1.5% | |
| 0 | 288 | 1.2% | |
| 53 | 177 | 0.7% | |
| 138 | 137 | 0.6% | |
| 115 | 130 | 0.5% | |
| 124 | 125 | 0.5% | |
| 6 | 119 | 0.5% | |
| 25 | 77 | 0.3% | |
| 140 | 74 | 0.3% | |
| 136 | 66 | 0.3% | |
| 77 | 57 | 0.2% | |
| 24 | 21 | 0.1% | |
| 76 | 19 | 0.1% | |
| 57 | 18 | 0.1% | |
| Other values (37) | 105 | 0.4% |
| Value | Count | Frequency (%) | |
| 0 | 288 | 1.2% | |
| 2 | 1 | < 0.1% | |
| 6 | 119 | 0.5% | |
| 12 | 1 | < 0.1% | |
| 14 | 657 | 2.8% | |
| 16 | 2 | < 0.1% | |
| 24 | 21 | 0.1% | |
| 25 | 77 | 0.3% | |
| 29 | 8165 | 34.2% | |
| 30 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 142 | 2714 | 11.4% | |
| 140 | 74 | 0.3% | |
| 139 | 10 | < 0.1% | |
| 138 | 137 | 0.6% | |
| 136 | 66 | 0.3% | |
| 133 | 356 | 1.5% | |
| 130 | 1205 | 5.1% | |
| 129 | 17 | 0.1% | |
| 128 | 4 | < 0.1% | |
| 124 | 125 | 0.5% |
| Distinct count | 28 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.46474681421864 |
|---|---|
| Minimum | 0 |
| Maximum | 50 |
| Zeros | 1017 |
| Zeros (%) | 4.3% |
| Memory size | 186.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 34 |
| median | 34 |
| Q3 | 34 |
| 95-th percentile | 46 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 8.38683369 |
|---|---|
| Coefficient of variation (CV) | 0.2506169772 |
| Kurtosis | 8.7395923 |
| Mean | 33.46474681 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.527453789 |
| Sum | 798335 |
| Variance | 70.33897934 |
| Value | Count | Frequency (%) | |
| 34 | 18947 | 79.4% | |
| 43 | 1503 | 6.3% | |
| 0 | 1017 | 4.3% | |
| 46 | 668 | 2.8% | |
| 23 | 642 | 2.7% | |
| 48 | 521 | 2.2% | |
| 36 | 182 | 0.8% | |
| 50 | 145 | 0.6% | |
| 9 | 92 | 0.4% | |
| 39 | 54 | 0.2% | |
| 24 | 20 | 0.1% | |
| 38 | 20 | 0.1% | |
| 18 | 13 | 0.1% | |
| 40 | 6 | < 0.1% | |
| 41 | 6 | < 0.1% | |
| 17 | 4 | < 0.1% | |
| 4 | 4 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 32 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 31 | 1 | < 0.1% | |
| 35 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 21 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| Other values (3) | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1017 | 4.3% | |
| 4 | 4 | < 0.1% | |
| 5 | 1 | < 0.1% | |
| 8 | 1 | < 0.1% | |
| 9 | 92 | 0.4% | |
| 12 | 1 | < 0.1% | |
| 14 | 1 | < 0.1% | |
| 15 | 2 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 17 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 50 | 145 | 0.6% | |
| 48 | 521 | 2.2% | |
| 46 | 668 | 2.8% | |
| 43 | 1503 | 6.3% | |
| 41 | 6 | < 0.1% | |
| 40 | 6 | < 0.1% | |
| 39 | 54 | 0.2% | |
| 38 | 20 | 0.1% | |
| 36 | 182 | 0.8% | |
| 35 | 1 | < 0.1% |
MULTIPLE_OFFENSE
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 186.5 KiB |
| 1 | |
|---|---|
| 0 | 1068 |
| Value | Count | Frequency (%) | |
| 1 | 22788 | 95.5% | |
| 0 | 1068 | 4.5% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| INCIDENT_ID | DATE | X_1 | X_2 | X_3 | X_4 | X_5 | X_6 | X_7 | X_8 | X_9 | X_10 | X_11 | X_12 | X_13 | X_14 | X_15 | MULTIPLE_OFFENSE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CR_102659 | 04-JUL-04 | 0 | 36 | 34 | 2 | 1 | 5 | 6 | 1 | 6 | 1 | 174 | 1.0 | 92 | 29 | 36 | 0 |
| 1 | CR_189752 | 18-JUL-17 | 1 | 37 | 37 | 0 | 0 | 11 | 17 | 1 | 6 | 1 | 236 | 1.0 | 103 | 142 | 34 | 1 |
| 2 | CR_184637 | 15-MAR-17 | 0 | 3 | 2 | 3 | 5 | 1 | 0 | 2 | 3 | 1 | 174 | 1.0 | 110 | 93 | 34 | 1 |
| 3 | CR_139071 | 13-FEB-09 | 0 | 33 | 32 | 2 | 1 | 7 | 1 | 1 | 6 | 1 | 249 | 1.0 | 72 | 29 | 34 | 1 |
| 4 | CR_109335 | 13-APR-05 | 0 | 33 | 32 | 2 | 1 | 8 | 3 | 0 | 5 | 1 | 174 | 0.0 | 112 | 29 | 43 | 1 |
| 5 | CR_96263 | 07-APR-03 | 0 | 45 | 45 | 10 | 3 | 1 | 0 | 1 | 6 | 1 | 303 | 1.0 | 72 | 62 | 34 | 1 |
| 6 | CR_131400 | 22-JAN-08 | 0 | 30 | 35 | 7 | 3 | 7 | 1 | 0 | 5 | 1 | 174 | 0.0 | 112 | 29 | 43 | 1 |
| 7 | CR_11981 | 14-MAY-93 | 0 | 8 | 7 | 7 | 3 | 9 | 8 | 0 | 5 | 1 | 316 | 1.0 | 72 | 62 | 34 | 1 |
| 8 | CR_184134 | 21-AUG-16 | 0 | 49 | 49 | 6 | 5 | 8 | 3 | 1 | 1 | 1 | 316 | 1.0 | 103 | 14 | 34 | 1 |
| 9 | CR_32634 | 25-AUG-96 | 1 | 4 | 4 | 6 | 5 | 15 | 10 | 0 | 5 | 2 | 145 | 1.0 | 103 | 29 | 34 | 0 |
Last rows
| INCIDENT_ID | DATE | X_1 | X_2 | X_3 | X_4 | X_5 | X_6 | X_7 | X_8 | X_9 | X_10 | X_11 | X_12 | X_13 | X_14 | X_15 | MULTIPLE_OFFENSE | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 23846 | CR_79724 | 17-SEP-01 | 1 | 36 | 34 | 2 | 1 | 15 | 10 | 0 | 5 | 1 | 249 | 1.0 | 92 | 130 | 34 | 1 |
| 23847 | CR_38033 | 02-MAY-96 | 0 | 33 | 32 | 2 | 1 | 5 | 6 | 1 | 6 | 2 | 249 | 2.0 | 103 | 93 | 34 | 1 |
| 23848 | CR_14384 | 02-DEC-93 | 0 | 26 | 27 | 9 | 0 | 3 | 5 | 0 | 5 | 1 | 249 | 1.0 | 112 | 130 | 34 | 1 |
| 23849 | CR_68953 | 25-APR-00 | 7 | 25 | 25 | 9 | 0 | 9 | 8 | 0 | 5 | 1 | 249 | 1.0 | 72 | 93 | 34 | 1 |
| 23850 | CR_33201 | 11-JUL-96 | 0 | 4 | 4 | 6 | 5 | 1 | 0 | 2 | 6 | 1 | 0 | 1.0 | 72 | 29 | 34 | 1 |
| 23851 | CR_88991 | 11-JAN-02 | 1 | 47 | 48 | 7 | 3 | 15 | 10 | 1 | 5 | 1 | 174 | 0.0 | 98 | 29 | 34 | 1 |
| 23852 | CR_46369 | 05-FEB-97 | 0 | 33 | 32 | 2 | 1 | 5 | 6 | 0 | 5 | 1 | 174 | 0.0 | 112 | 29 | 43 | 1 |
| 23853 | CR_157556 | 03-APR-12 | 0 | 25 | 25 | 9 | 0 | 3 | 5 | 1 | 6 | 1 | 174 | 0.0 | 10 | 29 | 18 | 1 |
| 23854 | CR_103180 | 25-JAN-04 | 0 | 39 | 39 | 6 | 5 | 2 | 7 | 1 | 6 | 1 | 127 | 0.0 | 112 | 103 | 43 | 1 |
| 23855 | CR_22575 | 08-NOV-94 | 7 | 36 | 34 | 2 | 1 | 9 | 8 | 0 | 5 | 1 | 249 | 1.0 | 92 | 29 | 34 | 1 |